AITopics | restless bandit problem

Collaborating Authors

restless bandit problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

0768281a05da9f27df178b5c39a51263-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 12:47:53 GMT

machine learning, reinforcement learning, whittle index, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation (0.48)
Government (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining (0.97)
(3 more...)

Add feedback

Restless-UCB,anEfficientandLow-complexity AlgorithmforOnlineRestlessBandits

Neural Information Processing SystemsFeb-9-2026, 06:55:51 GMT

In Restless-UCB, we present a novel method to construct offline instances,whichonlyrequiresO(N)time-complexity(N isthenumberofarms) and is exponentially better than the complexity of existing learning policy.

artificial intelligence, machine learning, underassumptions2, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Restless-UCB,anEfficientandLow-complexity AlgorithmforOnlineRestlessBandits

Neural Information Processing SystemsFeb-9-2026, 06:55:44 GMT

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.31)

Add feedback

89ae0fe22c47d374bc9350ef99e01685-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 06:55:33 GMT

complexity, figure 1, time complexity, (13 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.36)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.36)

Add feedback

0768281a05da9f27df178b5c39a51263-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 08:44:59 GMT

neural network, neurwin, whittle index, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Brazos County > College Station (0.14)
Asia > Taiwan (0.04)
Oceania > New Zealand (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation (0.48)
Government (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Neural Information Processing SystemsDec-24-2025, 06:33:12 GMT

We study the online restless bandit problem, where the state of each arm evolves according to a Markov chain, and the reward of pulling an arm depends on both the pulled arm and the current state of the corresponding Markov chain. In this paper, we propose Restless-UCB, a learning policy that follows the explore-then-commit framework. In Restless-UCB, we present a novel method to construct offline instances, which only requires $O(N)$ time-complexity ($N$ is the number of arms) and is exponentially better than the complexity of existing learning policy. We also prove that Restless-UCB achieves a regret upper bound of $\tilde{O}((N+M^3)T^{2\over 3})$, where $M$ is the Markov chain state space size and $T$ is the time horizon. Compared to existing algorithms, our result eliminates the exponential factor (in $M,N$) in the regret upper bound, due to a novel exploitation of the sparsity in transitions in general restless bandit problems. As a result, our analysis technique can also be adopted to tighten the regret bounds of existing algorithms. Finally, we conduct experiments based on real-world dataset, to compare the Restless-UCB policy with state-of-the-art benchmarks. Our results show that Restless-UCB outperforms existing algorithms in regret, and significantly reduces the running time.

efficient and low-complexity algorithm, name change, restless-ucb, (5 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.82)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Neural Information Processing SystemsAug-15-2025, 01:02:11 GMT

As a result, our analysis technique can also be adopted to tighten the regret bounds of existing algorithms.

bandit problem, restless bandit problem, restless-ucb, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.93)

Industry:

Education (0.46)
Telecommunications (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications > Networks (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.30)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.30)

Add feedback

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Neural Information Processing SystemsAug-15-2025, 01:02:04 GMT

As a result, our analysis technique can also be adopted to tighten the regret bounds of existing algorithms.

bandit problem, restless bandit problem, restless-ucb, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.93)

Industry:

Education (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.32)

Add feedback

89ae0fe22c47d374bc9350ef99e01685-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 01:01:53 GMT

complexity, figure 1, time complexity, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.55)

Add feedback

From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance

Xu, Jiamin, Nazarov, Ivan, Rastogi, Aditya, Periáñez, África, Gan, Kyra

arXiv.org Artificial IntelligenceFeb-7-2025

Online restless bandits extend classic contextual bandits by incorporating state transitions and budget constraints, representing each agent as a Markov Decision Process (MDP). This framework is crucial for finite-horizon strategic resource allocation, optimizing limited costly interventions for long-term benefits. However, learning the underlying MDP for each agent poses a major challenge in finite-horizon settings. To facilitate learning, we reformulate the problem as a scalable budgeted thresholding contextual bandit problem, carefully integrating the state transitions into the reward design and focusing on identifying agents with action benefits exceeding a threshold. We establish the optimality of an oracle greedy solution in a simple two-state setting, and propose an algorithm that achieves minimax optimal constant regret in the online multi-state setting with heterogeneous agents and knowledge of outcomes under no intervention. We numerically show that our algorithm outperforms existing online restless bandit methods, offering significant improvements in finite-horizon performance.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2502.05145

Country: